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Sequence Comparison A 



RESULT 1 
US-08-253-155A-12 

Sequence 12, Application US/08253155A 
Patent No. 5691147 
GENERAL INFORMATION: 

APPLICANT: Gyuris, Jeno 
APPLICANT: Draetta, Giulio 
TITLE OF INVENTION: CDK4 Binding Proteins 
NUMBER OF SEQUENCES: 95 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD 
STREET: 60 State Street 
CITY: Boston 
STATE : MA 
COUNTRY : USA 
ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE : ASCII (text) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/253 , 155A 
FILING DATE: 02-JUN-1994 
CLASSIFICATION : 435 
ATTORNEY /AGENT INFORMATION: 
NAME: Vincent, Matthew P. 
REGISTRATION NUMBER: 36,709 
REFERENCE/DOCKET NUMBER: MII-028 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617) 227-7400 
TELEFAX: (617) 227-5941 
INFORMATION FOR SEQ ID NO: 12: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1075 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-253-155A-12 



Query Match 21.6%; Score 890; DB 1; Length 1075; 

Best Local Similarity 96.2%; Pred. No. 2.1e-237; 

Matches 975; Conservative 1; Mismatches 31; Indels 7; Gaps 

Qy 2918 tcaccatcttcggccctgtcaacaacagcaccatgaaaatcgaccactttcagctagata 2977 

II II llllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 44 TCTGCAAGTTCGGCCCTGTCAACAACAGCACCATGAAAATCGACCACTTTCAGCTAGATA 103 

Qy 2978 atgagaagcccatgcgagtggtggatgatgaagacttggtagaccagcgtctcatcagcg 3037 

llllllllllllllllllllllllllllllllllll IIIIIIMIIIIIIIIIIIIIII 

Db 104 ATGAGAAGCCCATGCGAGTGGTGGATGATGAAGACTGGGTAGACCAGCGTCTCATCAGCG 163 

Qy 3038 agctgaggaaagagtacggaatgacctacaatgacttcttcatggtgctaacagatgtgg 3097 

llllllllll lllllllll lllllllll III lllllllllllllllllllllllll 
Db 164 AGCTGAGGAAGGAGTACGGATTGACCTACACTGA-TTCTTCATGGTGCTAACAGATGTGG 222 

Qy 3098 atctgagagtcaagcaatactatgaggtaccaataacaatgaagtctgtgtttg-atctg 3156 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiMiiiiiiiiiMiiii mil 

Db 223 ATCTGAGAGTCAAGCAATACTATGAGGTACCAATAACAATGAAGTCTGTGTTTGCATCTG 282 
Qy 3157 atcgatactttccagtcccgaatcaaagatatggagaagcagaagaaggagggcattgtt 3216 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 283 ATCGATACTTTCCAGTCCCGAATCAAAGATATGGAGAAGCAGAAGAAGGAGGGCATTGTT 342 
Qy 3217 tgcaaagaggacaaaaagcagtccctggagaacttcctatccaggttccggtggaggagg 3276 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 
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Qy 
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Qy 
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463 


Qy 


3397 


Db 


522 


Qy 


3457 
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582 


Qy 


3517 


Db 


641 


Qy 


3577 


Db 


701 


Qy 


3637 


Db 


761 


Qy 


3697 


Db 


821 


Qy 


3757 


Db 


881 


Qy 


3817 


Db 


941 


Qy 


3876 


Db 


999 



343 TGCAAAGAGGACAAT^GCAGTCCCTGGAGAACTTCCTATCCAGGTTCCGGTGGAGGAGG 4 02 

aggttgctggtgatctctgctcctaacgatgaagactgggcctattcacagcagctctct 3336 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIII 

AGGTTGCTGGTGATCTCTGCTCCTAACGATGAAGACTGGGCCTATTCACAGCAGCTCTCT 462 

gccctcagtggtcaggcgtgcaattttggtctgcgccacataaccattctgaagctttta 3396 

llllllllllllllllllllllllll llllllllllllllllllllllllllll INI 

GCCCTCAGTGGTCAGGCGTGCAATTTGGGTCTGCGCCACATAACCATTCTGAAGC-TTTA 521 

ggcgttggagaggaagttgggggagtgttagaactgttcccaattaatgggagctctgtt 34 56 

llllllllllllllllllllllllll lllllllllllllllllllllllllllllllll 

GGCGTTGGAGAGGMGTTGGGGGAGTCTTAGAACTGTTCCCAATTAATGGGAGCTCTGTT 581 

gttgagcgagaagacgtaccagcccatttggtgaaagacattcgtaactattttcaagtg 3516 

llllllllilllllllhllllllllMI llllllllllllllllllllllllllllll 



agcccggagtacttctccatgcttctagtcggaaaagacggaaatgtcaaatcctggtat 3576 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGCCCGGAGTACTTCTCCATGCTTCTAGTCGGAAAAGACGGAAATGTCAAATCCTGGTAT 700 

ccttccccaatgtggtccatggtgattgtgtacgatttaattgattcgatgcaacttcgg 3636 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CCTTCCCCAATGTGGTCCATGGTGATTGTGTACGATTTAATTGATTCGATGCAACTTCGG 760 

agacaggaaatggcgattcagcagtcactggggatgcgctgcccagaagatgagtatgca 3696 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGACAGGAAATGGCGATTCAGCAGTCACTGGGGATGCGCTGCCCAGAAGATGAGTATGCA 820 

ggctatggttaccatagttaccaccaaggataccaggatggttaccaggatgactaccgt 3756 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIMI 

GGCTATGGTTACCATAGTTACCACCAAGGATACCAGGATGGTTACCAGGATGACTACCGT 880 
catcatgagagttatcaccatggatacccttactgagcagaaatatgtaaccttagactc 3816 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

CATCATGAGAGTTATCACCATGGATACCCTTACTGAGCAGAAATATGTAACCTTAGACTC 94 0 
agccagtttcctctgcagctgctaaaactacatgtggccagctccattcttccacact-g 3875 

I I IIIIIIIIIIIIIIIII mill iiiiiiiiiiiiiiiiiiiiiiii I 

A - - CCATTTCCTCTGCAGCTGCTGGAACTACGATTGGCCAGCTCCATTCTTCCACACTGG 998 
cgtactacatttcctgcctttttctttcagtgtttttctaagactaaataaata 3929 

lllllllllllllll llllllllllll I INI I I IMIII 

GGTACTACATTTCCTGGCTTTTTCTTTCAAGGGTTTTTCTTTAGGACTTAAATA 1052 




SUMMARIES 
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RESULT 9 
AAW19990 

ID AAW19990 Standard; Protein; 320 AA. 
XX 

AC AAW19990; 
XX 

DT 27-AUG-1997 (first entry) 
XX 

DE Type I, p80 IL-l-receptor intracellular domain ligand. 
XX 

KW IL; interleukin; receptor; ligand; screening assay; inhibitor; 

KW IL-1 mediated response; inflammation; inflammatory; antibody; 

KW intracellular domain; CANP; calcium activated neutral protease, 
XX 



OS Homo sapiens. 
XX 

PN WO9640907-A1. 
XX 

PD 19-DEC-1996. 
XX 

PF 06-MAY-1996; 96WO-US06363 . 
XX 

PR 07-JUN-1995; 95US-0487942 . 
XX 

PA (GEMY ) GENETICS INST INC. 
XX 

PI Graham J, Lin L; 
XX 

DR WPI; 1997-052315/05. 

DR N-PSDB; AAT71217 . 
XX 

PT Interleukin-1 receptor intracellular ligand proteins and, related DNA 

PT - used to identify inhibitors of the proteins for treatment of 

PT inflammation 
XX 

PS Claim 3; Page 32-33; 54pp; English. 
XX 

CC AAW19990 represents a protein that has interleukin-1 receptor (IL-l-R) 

CC intracellular ligand activity. IL-l-R intracellular ligand proteins 

CC are used to screen for agents (e.g. antibodies) that are capable of 

CC inhibiting or blocking the binding of an IL-l-R intracellular ligand 

CC to the intracellular domain of IL-l-R, i.e. inhibitors of IL-1 

CC activity. Such agents can be used to treat inflammatory conditions. 

XX 

SQ Sequence 320 AA; 



Query Match 27.1%; Score 1343.5; DB 18; Length 320; 

Best Local Similarity 81.4%; Pred. No. 1.6e-87; 

267; Conservative 0; Mismatches 0; Indels 61; Gaps 

KKGGKTEQDGYQKPTNKHFTQSPKKSVADLLGSFEGKRRLLLITAPKAENNMYVQQRDEY 64 5 

MIIMIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 

kkggkteqdgyqkptnkhf tqspkksvadl Igs f egkrrl 11 it apkaennmyvqqrdey 60 
LESFCKMATRKISVITIFGPVNNSTMKIDHFQLDNEKPMRWDDEDLVDQRLISELRKEY 705 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

lesf ckmatrkisvitif gpvnnstmkidhfqldnekpmrwddedlvdqrliselrkey 120 

GMTYNDFFMVLTDVDLRVKQYYEVPITMKSVFDLIDTFQSRIKDMEKQKKEGIVCKEDKK 765 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
gmtyndf f mvl tdvdlrvkqyyevpitmksvf dl idt f qsr ikdmekqkkegivck 176 



III 

-eev 179 
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Qy 


886 
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240 



GGVLELFPINGSSWEREDVPAHLVKDIRNYFQVSPEYFSMLLVGKDGNVKSWYPSPMWS 885 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ggvlelfpingssweredvpahlvkdirnyfqvspeyf smllvgkdgnvkswypspmws 23 9 

MVIVYDLIDSMQLRRQEMAIQQSLGMRC 913 

IIIIIIIIIIIIIIIIIIIIIIIIIMI 
mvivydlidsmqlrrqemaiqqslgmrc 267 



RESULT 11 
AAR90544 

ID AAR90544 standard; protein; 278 AA. 
XX 

AC AAR90544; 
XX 

DT 08-AUG-1996 (first entry) 
XX 



DE pJG4-5-CDK-BP clone #12 5 derived CDK4 binding protein. 
XX 

KW Cell cycle; CDK4 ; regulation; Gl phase; proliferation; tumourigenesis; 

KW cyclin dependent kinase; differentiation; CDK4 inhibitor; agonist; 

KW antagonist . 
XX 

OS Synthetic. 
XX 

PN W09533819-A2. 
XX 

PD 14-DEC-1995. 
XX 

PF 02-JUN-1995; 95WO-US07113 . 
XX 

PR 02-JUN-1994; 94US-0253155 . 
XX 

PA (MITO-) MITOTIX INC. 
XX 

PI Draetta G, Gyuris J; 
XX 

DR WPI; 1996-040227/04. 

DR N-PSDB; AAT12173 . 
XX 

PT Cyclin- dependent kinase -4 binding protein - used in the isolation of 

PT (ant) agonists of cell cycle regulation. 

XX 

PS Claim 1; Page 81-82; 115pp; English. 
XX 

CC AAR90533-R90556 are cyclin dependent kinase 4 {CDK4) binding proteins. 

CC encoded by clones of the plasmid pJG4-5-CDKBP . CDK4 binding proteins 

CC (CDK4-BP) may be used in an assay for screening test compounds as 

CC inhibitors of CDK/CDK4-BP interaction. The complexes formed by CDK4 

CC and D-type cyclins are strongly implicated in the control of the early 

CC Gl phase of the cell cycle and are strong candidates for controlling 

CC and/or preventing tumourigenesis and the onset of cancer. Nucleic acids 

CC encoding CDK4-BP or fragments of these may be used as probes/primers 

CC to diagnose the presence or absence of genetic lesions in a gene 

CC encoding 1 of the 24 CDK4-BP, and hence to diagnose the risk for a 

CC subject of developing a cell -proliferation associated disorder (e.g. 

CC cancer) . 
XX 

SQ Sequence 278 AA; 



Query Match 25.7%; Score 1276.5; DB 17; Length 278; 

Best Local Similarity 92.0%; Pred. No. 7.6e-83; 

Matches 254; Conservative 4; Mismatches 13; Indels 5; Gaps 2; 

Qy 575 KSKQEKEKSKKKKGGKTEQDGYQKPTNKHFTQSPKKSVADLLG-SFEGKRRLLLITAPKA 633 

HIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIII= I lllllllll 

Db 2 eskqekekskkkkggkteqdgyqkptnkhf tqspkevsgrpagvlwkaneglllitapka 61 

Qy 634 ENNMYVQQRDEYLESFCKMATRKISVITIFGPVNNSTMKIDHFQLDNEKPMRWDDEDLV 693 

I llllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 62 ee qqrdeylesf ckmatrkisvitifgpvnnstmkidhf qldnekpmrwddedlv 117 

Qy 694 DQRLISELRKEYGMTYNDFFMVLTDVDLRVKQYYEVPITMKSVFDLIDTFQSRIKDMEKQ 753 

lllllllllllllllllllllllllllllllllllllllllll IIIIIIIIIIIIIIM 

Db 118 dqrliselrkeygmtyndf fmvltdvdlrvkqyyevpitmksvldlidtfqsrikdmekq 177 
Qy 754 KKEGIVCKEDKKQSLENFLSRFRWRRRLLVISAPNDEDWAYSQQLSALSGQACNFGLRHI 813 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIMIMIIIIIIIIIIIMIIIIIIIIIIIII 

Db 178 kkegivckedkkqslenflsrf rwrrrllvisapndedwaysqqlsalsgqacnfglrhi 237 
Qy 814 TILKLLGVGEEVGGVLELFPINGSSWEREDVPAHL 849 

IIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIM 

Db 238 tilkllgvgeevggvlelfpingssweredvpahl 273 



Issued : 
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No. Score Match Length DB ID Description 
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RESULT 1 
US-08-726-525-4 

; Sequence 4, Application US/08726525 
; Patent No. 5789181 
; GENERAL INFORMATION: 

APPLICANT: Lin, Lih-Ling 
; APPLICANT: Graham, James 

TITLE OF INVENTION: NOVEL INTERLEUKIN-1 RECEPTOR 

TITLE OF INVENTION: INTRACELLULAR LIGAND PROTEINS AND INHIBITORS OF LIGAND 
TITLE OF INVENTION: BINDING 
NUMBER OF SEQUENCES: 7 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: LEGAL AFFAIRS, GENETICS INSTITUTE, INC. 

STREET: 87 CambridgePark Drive 

CITY: Cambridge 

STATE : MA 

COUNTRY : USA 

ZIP: 02140 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC -DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/726,525 

FILING DATE: 07-OCT-1996 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/487,942 

FILING DATE: 07-JUN-1995 
ATTORNEY/AGENT INFORMATION: 

NAME: Brown, Scott A. 

REGISTRATION NUMBER: 32,724 

REFERENCE/DOCKET NUMBER: GI5258 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (617) 498-8224 

TELEFAX: (617) 876-5851 
; INFORMATION FOR SEQ ID NO : 4 : 
SEQUENCE CHARACTERISTICS: 



LENGTH: 320 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-726-525-4 



Query Match 27.1%; Score 134 3.5; DB 1; Length 320; 

Best Local Similarity 81.4%; Pred. No. 3.1e-90; 

267; Conservative 0; Mismatches 0; Indels 61; Gaps 

KKGGKTEQDGYQKPTNKHFTQSPKKSVADLLGSFEGKRRLLLITAPKAENNMYVQQRDEY 645 

MIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIII 
KKGGKTEQDGYQKPTNKHFTQSPKKSVADLLGSFEGKRRLLLITAPKAENNMYVQQRDEY 60 

LESFCKMATRKISVITIFGPVNNSTMKIDHFQLDNEKPMRWDDEDLVDQRLISELRKEY 705 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 
LESFCKMATRKISVITIFGPVNNSTMKIDHFQLDNEKPMRWDDEDLVDQRLISELRKEY 120 

GMTYNDFFMVLTDVDLRVKQYYEVPITMKSVFDLIDTFQSRIKDMEKQKKEGIVCKEDKK 765 

MIMIIMIIIIIIMMIIIIIIIIIIIIIMIIIMIIIIIIIIIMIIIIII 

GMTYNDFFMVLTDVDLRVKQYYEVPITMKSVFDLIDTFQSRIKDMEKQKKEGIVCK 176 



III 
-EEV 179 
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Qy 


766 


Db 
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Qy 


826 


Db 


180 


Qy 


886 


Db 


240 



GGVLELFPINGSSWEREDVPAHLVKDIRNYFQVSPEYFSMLLVGKDGNVKSWYPSPMWS 885 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
GGVLELFPINGSSWEREDVPAHLVKDIRNYFQVSPEYFSMLLVGKDGNVKSWYPSPMWS 239 

MVIVYDLIDSMQLRRQEMAIQQSLGMRC 913 

llllllllllllllllllllllllllll 
MVIVYDLIDSMQLRRQEMAIQQSLGMRC 267 



RESULT 5 
US-08-253-155A-36 

; Sequence 36, Application US/08253155A 
; Patent No. 5691147 
; GENERAL INFORMATION: 

APPLICANT: Gyuris, Jeno 

APPLICANT: Draetta, Giulio 

TITLE OF INVENTION: CDK4 Binding Proteins 

NUMBER OF SEQUENCES: 95 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD 

STREET: 60 State Street 

CITY: Boston 

STATE : MA 

COUNTRY: USA 

ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: ASCII (text) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/253 , 155A 

FILING DATE: 02-JUN-1994 

CLASSIFICATION: 435 
ATTORNEY /AGENT INFORMATION: 

NAME: Vincent, Matthew P. 

REGISTRATION NUMBER: 36,709 

REFERENCE/DOCKET NUMBER: MI I -02 8 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (617) 227-7400 

TELEFAX: (617) 227-5941 
; INFORMATION FOR SEQ ID NO: 36: 



SEQUENCE CHARACTERISTICS: 
LENGTH: 358 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FRAGMENT TYPE: internal 
US-08-253-155A-36 



Query Match 13.7%; Score 682; DB 1; Length 358; 

Best Local Similarity 66.7%; Pred. No. 5.1e-42; 

Matches 144; Conservative 12; Mismatches 32; Indels 28; Gaps 

Qy 640 QQRDEYLESFCKMATRKISVITIFGPVNNSTMKIDHFQLDNEKPMRWDDEDLVDQRLIS 699 

IIMIIIIIIII Mlllllllllllllllllllllllllll lllllll 

Db 6 QQRDEYLESFCK FGPVNNSTMKIDHFQLDNEKPMRWDDEDWVDQRLIS 54 

Qy 700 ELRKEYGMTYND FFMVLTDVDLRVKQYYEVPITMKSVFDLIDTFQSRIKDMEKQKK 755 

llllllhll I I : : :: : : : I I I I I I I I I I M I I I I I 

Db 55 ELRKEYGLTYTDSSWCXQMWIXESSNTMRYQXQXSLCL HLIDTFQSRIKDMEKQKK 110 

Qy 756 EGIVCKEDKKQSLENFLSRFRWRRRLLVISAPNDEDWAYSQQLSALSGQACNFGLRHITI 815 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIM lllllll 
Db 111 EGIVCKEDKKQSLENFLSRFRWRRRLLVISAPNDEDWAYSQQLSALSGQACNLGLRHITI 170 

Qy 816 LKLLGVGEEVG GVLELFPINGSSWER 842 

III : ::| III =11 

Db 171 LKLXALERKLGESXNCSQLMGALLLSEKTXQPICER 206 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


212 .5 


4 


3 


1507 


2 


B47328 


natural killer eel 


2 


211 


4 


2 


2022 


2 


T48818 


glucan 1,4-alpha-g 


3 


211 


4 


2 


5327 


2 


T13564 


microtubule- as soci 


4 


210 


4 


2 


1263 


2 


T15496 


hypothetical prote 


5 


209.5 


4 


2 


1618 


2 


S21424 


nestin - human 


6 


209 


4 


2 


891 


2 


G84693 


probable proline-r 


7 


207.5 


4 


2 


3488 


2 


T34418 


hypothetical prote 


8 


203 


4 


1 


706 


2 


A45990 


junctional sarcopl 


9 


202 


4 


1 


6642 


2 


T29757 


protein UNC-89 - C 


10 


201.5 


4 


1 


1560 


2 


T42727 


proliferation pote 


11 


200 


4 


0 


611 


2 


T22456 


hypothetical prote 


12 


199.5 


4 


0 


892 


2 


T50985 


related to transcr 


13 


198 


4 


0 


990 


2 


151618 


nucleolar phosphop 


14 


197.5 


4 


0 


2526 


2 


T20531 


hypothetical prote 


15 


197.5 


4 


0 


2722 


2 


T20532 


hypothetical prote 


16 


197.5 


4 


0 


2738 


2 


E88320 


protein F07A11.6 [ 


17 


196.5 


4 


0 


476 


2 


T27051 


hypothetical prote 


18 


196 


3 


9 


946 


2 


F88196 


protein ZK1127.9 [ 


19 


196 


3 


9 


1015 


2 


JC6552 


DNA topoisomerase 


20 


196 


3 


9 


1087 


1 


QFMSH 


neurofilament trip 


21 


193 


3 


9 


2441 


2 


S39161 


CREB -binding prote 



SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length DB 


ID 


Description 


1 


2342 


47.2 


507 


11 


Q9CT39 


Q9ct39 mus musculu 


2 


2337 


47.1 


867 


13 


Q90X49 


Q90x4 9 brachydanio 


3 


1466 


29.5 


385 


11 


Q9JKW4 


Q9jkw4 rattus norv 


4 


1148.5 


23.1 


358 


11 


Q9CRM1 


Q9crml mus musculu 


5 


1065 


21,4 


209 


11 


Q9D6Z4 


Q9d6z4 mus musculu 


6 


255.5 


5.1 


897 


11 


070495 


0704 95 mus musculu 



7 


228.5 


4 


.6 


820 


4 


060585 


060585 homo sapien 


8 


211 


4 . 


.2 


2090 


3 


Q9P6X4 


Q9p6x4 neurospora 


9 


211 


4 


.2 


5327 


5 


076891 


076891 drosophila 


10 


210 


4 


.2 


1422 


6 


Q95KU4 


Q95ku4 canis famil 


11 


209 


4 


.2 


891 


10 


Q9ZW08 


Q9zw08 arabidopsis 


12 


209 


4 . 


.2 


894 


10 


Q9FYB2 


Q9fyb2 arabidopsis 


13 


208 


4 . 


.2 


1289 


5 


Q9VWC0 


Q9vwc0 drosophila 


14 


205.5 


4 . 


.1 


3484 


5 


P91257 


P91257 caenorhabdi 


15 


204 .5 


4 . 


.1 


2951 


5 


Q9W3Z0 


Q9w3z0 drosophila 



SEQ ID NO: 26 



SUMMARIES 

% 

Result Query 





No. 


Score 


Match Length 


DB 


ID 


Description 




1 


2892, 


.6 


70 


.3 


2927 


6 


AX013703 


AX013703 Sequence 




2 


2390 


.4 


58 


.1 


215719 


9 


AC048334 


AC048334 Homo sapi 




3 


2344 


.2 


56 


.9 


4467 


6 


AX073687 


AX073687 Sequence 




4 


2250, 


.6 


54 


. 7 


3719 


10 


AF223677 


AF223677 Rattus no 


c 


5 


1830, 


.6 


44 


.5 


168079 


2 


AC024709 


AC024 709 Homo sapi 




6 


1682. 


.2 


40 


.9 


1740 


6 


AX073686 


AX073686 Sequence 


c 


7 


998, 


.2 


24 


.2 


1061 


6 


AX188264 


AX188264 Sequence 




8 


890 


21 


.6 


1075 


6 


176208 


176208 Sequence 12 




9 


863 


21 


.0 


1088 


6 


AR020615 


AR02061S Sequence 




10 


863 


21, 


.0 


1088 


6 


AR044086 


AR044086 Sequence 




11 


863 


21 


.0 


1088 


6 


AR085161 


AR085161 Sequence 


c 


12 


666, 


.4 


16. 


.2 


175135 


2 


AC079211 


AC079211 Homo sapi 


c 


13 


666, 


.4 


16 


.2 


185101 


2 


AC079202 


AGO 7 92 02 Homo sapi 




14 


618, 


.8 


15. 


.0 


780 


6 


AX053275 


AX053275 Sequence 




15 


263 


6 


.4 


121086 


5 


AL590146 


AL590146 Zebrafish 


c 


16 


227, 


.6 


5 . 


.5 


250 


11 


G11501 


G11501 SHGC-14633 




17 


166. 


.8 


4 . 


.1 


231 


6 


AX186301 


AX186301 Sequence 




18 


156. 


.8 


3 


.8 


221 


6 


AX184638 


AX184638 Sequence 




19 


153. 


.4 


3 . 


.7 


168079 


2 


AC024709 


AC024 709 Homo sapi 




20 


151. 


.4 


3 , 


.7 


182 


6 


AX187502 


AX187502 Sequence 




21 


140. 


.2 


3 , 


.4 


185 


6 


AX182067 


AX182067 Sequence 


c 


22 


113 . 


.2 


2 , 


. 7 


790 


6 


AX053276 


AX053276 Sequence 


c 


23 


94 . 


.6 


2 , 


.3 


7218 


6 


166494 


166494 Sequence 14 


c 


24 


89. 


,4 


2 , 


.2 


306 


6 


176229 


176229 Sequence 57 


c 


25 


86. 


.4 


2 . 


.1 


118632 


9 


AL133380 


AL133380 Human DNA 



8 



RESULT 
176208 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 

BASE COUNT 
ORIGIN 



176208 1075 bp DNA linear 

Sequence 12 from patent US 5691147. 

176208 

176208.1 GI:3012362 

Unknown . 

Unknown . 

Unclassified. 

1 (bases 1 to 1075) 

Draetta.G. and Gyuris,J. 

CDK4 binding assay 

Patent: US 5691147-A 12 25-NOV-1997; 
Location/Qualifiers 
1. .1075 

/organisms "unknown" 
291 a 234 c 269 g 280 t 1 others 



PAT 03-APR-1998 



Query Match 21.6%; Score 890; DB 6; Length 1075; 

Best Local Similarity 96.2%; Pred. No. 2.4e-193; 

Matches 975; Conservative 1; Mismatches 31; Indels 7; Gaps 6; 



Qy 


2918 


Db 


44 


Qy 


2978 


Db 


104 


Qy 


3038 


Db 


164 


Qy 


3098 


Db 


223 


Qy 


3157 


Db 


283 


Qy 


3217 


Db 


343 


Qy 


3277 


Db 


403 


Qy 


3337 


Db 


463 


Qy 


3397 


Db 


522 


Qy 


3457 


Db 


582 


Qy 


3517 


Db 


641 


Qy 


3577 


Db 


701 


Qy 


3637 


Db 


761 


Qy 


3697 


Db 


821 


Qy 


3757 


Db 


881 


Qy 


3817 


Db 


941 


Qy 


3876 


Db 


999 



tcaccatcttcggccctgtcaacaacagcaccatgaaaatcgaccactttcagctagata 2977 

II II IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TCTGCAAGTTCGGCCCTGTCAACAACAGCACCATGAAAATCGACCACTTTCAGCTAGATA 103 
atgagaagcccatgcgagtggtggatgatgaagacttggtagaccagcgtctcatcagcg 3037 

llllllllllllllllllllllllllllllllllll IIIIIIIIIIIIIIIIIIIIIM 

ATGAGAAGCCCATGCGAGTGGTGGATGATGAAGACTGGGTAGACCAGCGTCTCATCAGCG 163 
agctgaggaaagagtacggaatgacctacaatgacttcttcatggtgctaacagatgtgg 3097 

IIIIIIIMI lllllllll lllllllll III lllllllllllllllllllllllll 

AGCTGAGGAAGGAGTACGGATTGACCTACACTGA-TTCTTCATGGTGCTAACAGATGTGG 222 
atctgagagtcaagcaatactatgaggtaccaataacaatgaagtctgtgtttg-atctg 3156 

IIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIMIIIIIIIIIIIIMII Mill 

ATCTGAGAGTCAAGCAATACTATGAGGTACCAATAACAATGAAGTCTGTGTTTGCATCTG 282 
atcgatactttccagtcccgaatcaaagatatggagaagcagaagaaggagggcattgtt 3216 

IIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

ATCGATACTTTCCAGTCCCGAATCAAAGATATGGAGAAGCAGAAGAAGGAGGGCATTGTT 342 

tgcaaagaggacaaaaagcagtccctggagaacttcctatccaggttccggtggaggagg 3276 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TGCAAAGAGGACAAAAAGCAGTCCCTGGAGAACTTCCTATCCAGGTTCCGGTGGAGGAGG 4 02 

aggttgctggtgatctctgctcctaacgatgaagactgggcctattcacagcagctctct 3336 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGGTTGCTGGTGATCTCTGCTCCTAACGATGAAGACTGGGCCTATTCACAGCAGCTCTCT 462 
gccctcagtggtcaggcgtgcaattttggtctgcgccacataaccattctgaagctttta 3396 

IIMIIIIIIIIIIIIIIIMIIIII IIIIIIIIIMIIIIIIIIIIIIIIIII MM 

GCCCTCAGTGGTCAGGCGTGCAATTTGGGTCTGCGCCACATAACCATTCTGAAGC - TTTA 521 



IMIIMMMIMIIMIIIMMI II II I M M M II I II I M I M 1 1 M II I II M 

GGCGTTGGAGAGGAAGTTGGGGGAGTCTTAGAACTGTTCCCAATTAATGGGAGCTCTGTT 581 



IIIMIIIIMIIIIIIMMMMMM 1 1 II II I II M I II II I II M M 1 1 M 1 1 1 

GTTGAGCGAGAAGACGTWCCAGCCCATTT - GTGAAAGACATTCGTAACTATTTTCAAGTG 64 0 
agcccggagtacttctccatgcttctagtcggaaaagacggaaatgtcaaatcctggtat 3576 

IIMMMMMMMMMIMIMMMMMMMMMMMIIMMMMMM 

AGCCCGGAGTACTTCTCCATGCTTCTAGTCGGAAAAGACGGAAATGTCAAATCCTGGTAT 700 
ccttccccaatgtggtccatggtgattgtgtacgatttaattgattcgatgcaacttcgg 3636 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

CCTTCCCCAATGTGGTCCATGGTGATTGTGTACGATTTAATTGATTCGATGCAACTTCGG 760 
agacaggaaatggcgattcagcagtcactggggatgcgctgcccagaagatgagtatgca 3696 

IIIIIIIIMIIMIMIIIIIIMIMIIIIMMIIIIIIIIIIIIIIIIIIIIMII 

AGACAGGAAATGGCGATTCAGCAGTCACTGGGGATGCGCTGCCCAGAAGATGAGTATGCA 820 
ggctatggttaccatagttaccaccaaggataccaggatggttaccaggatgactaccgt 3756 

MMIMMIMM II IIIIIIMIIIIIMIIIIIIIIIMIIIIIIII II IIMIIII 

GGCTATGGTTACCATAGTTACCACCAAGGATACCAGGATGGTTACCAGGATGACTACCGT 880 
catcatgagagttatcaccatggatacccttactgagcagaaatatgtaaccttagactc 3816 

lillllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

CATCATGAGAGTTATCACCATGGATACCCTTACTGAGCAGAAATATGTAACCTTAGACTC 94 0 
agccagtttcctctgcagctgctaaaactacatgtggccagctccattcttccacact -g 3875 

I I IIIIMIIIIIIIIMI IIIMI IIIIIIIMIIIIIIIIIIMIII I 

A - - CCATTTCCTCTGCAGCTGCTGGAACTACGATTGGCCAGCTCCATTCTTCCACACTGG 998 
cgtactacatttcctgcctttttctttcagtgtttttctaagactaaataaata 3929 

lllllllllllllll llllllllllll I MM I I MMM 

GGTACTACATTTCCTGGCTTTTTCTTTCAAGGGTTTTTCTTTAGGACTTAAATA 1052 



SUMMARIES 

% 

Result Query 



No. 


Score 


Match Length DB 


ID 


Description 


1 


4117 


100 


. 0 


4117 


20 


AAZ09839 


Human membrane spa 


2 


2892.6 


70 


.3 


2927 


20 


AAZ42150 


Human normal bladd 


3 


2344 .2 


56 


. 9 


4467 


22 


AAC85085 


Atherosclerosis -as 


4 


1817.2 


44 


. 1 


1847 


22 


AAH19187 


Human secreted pro 


5 


1757.2 


42 


.7 


1815 


22 


AAK94552 


Human full-length 


6 


1682.2 


40 


.9 


1740 


22 


AAC85084 


Atherosclerosis -as 


7 


1656.8 


40 


.2 


1792 


22 


AAF72810 


Secreted protein g 


8 


1572.2 


38 


.2 


1673 


22 


AAF72835 


Secreted protein g 


9 


1535 


37 


.3 


1561 


21 


AAC77711 


Human cancer assoc 


C 10 


1512 .2 


36 


.7 


1578 


22 


AAL36188 


Human musculoskele 


11 


1399.6 


34 , 


.0 


1438 


22 


AAH19228 


Human secreted pro 


12 


1183 .8 


28 


.8 


1285 


17 


AAT12173 


Partial pJG4-5-CDK 


13 


1183.8 


28 


.8 


1285 


17 


AAT12174 


Partial pJG4-5-CDK 


C 14 


998.2 


24 , 


.2 


1061 


22 


AAL26622 


Human breast cance 


c 15 


998.2 


24 


.2 


1061 


22 


AAL26732 


Human breast cance 


c 16 


998.2 


24 . 


.2 


1061 


22 


AAH72682 


Human cervical can 



RESULT 12 
AAT12173 

ID AAT12173 standard; cDNA; 1285 BP. 
XX 

AC AAT12173; 
XX 

DT 08-AUG-1996 (first entry) 
XX 

DE Partial pJG4-5-CDK-BP cDNA clone #125 encoding CDK4 binding protein. 
XX 

KW Cell cycle; regulation; Gl phase; proliferation; tumourigenesis ; 
KW cyclin dependent kinase; differentiation; CDK4 inhibitor; agonist; 
KW antagonist; ss. 
XX 

OS Synthetic, 
XX 

FH Key Location/Qualifiers 
FT CDS 8. .845 

FT /*tag= a 

FT /note= "does not contain start codon" 

XX 

PN W09533819-A2. 
XX 

PD 14-DEC-1995. 
XX 

PF 02-JUN-1995; 95WO-US07113 . 
XX 

PR 02-JUN-1994; 94US-0253155 . 
XX 

PA (MITO-) MITOTIX INC. 
XX 

PI Draetta G, Gyuris J; 
XX 

DR WPI; 1996-040227/04. 
DR P-PSDB; AAR90544 . 
XX 

PT Cyclin -dependent kinase-4 binding protein - used in the isolation of 

PT (ant) agonists of cell cycle regulation. 

XX 

PS Claim 30; Page 60-61; 115pp; English. 
XX 

CC AAT12162-T12184 are cDNA clones of the plasmid pJG4-5-CDK-BP, or 

CC fragments of the clones. The clones encode cyclin dependent kinase 4 

CC (CDK4) binding proteins (CDK4-BP) , which may be used in an assay for 

CC screening test compounds as inhibitors of CDK/CDK4-BP interaction. 

CC The complexes formed by CDK4 and D-type cyclins are strongly implicated 

CC in the control of the early Gl phase of the cell cycle and are strong 



CC candidates for controlling and/ or preventing tumour igenes is and the 

CC onset of cancer. Nucleic acids encoding CDK4-BP or fragments of these 

CC may be used as probes/primers to diagnose the presence or cibsence of 

CC genetic lesions in a gene encoding 1 of the 24 CDK4-BP, and hence to 

CC diagnose the risk for a subject of developing a cell -proliferation 

CC associated disorder (e.g. cancer). 
XX 

SQ Sequence 1285 BP; 376 A; 285 C; 319 G; 305 T; 0 other; 



Query Match 28.8%; Score 1183.8; DB 17; Length 1285; 

Best Local Similarity 97.1%; Pred. No. 2.2e-288; 

Matches 1252; Conservative 0; Mismatches 22; Indels 15; Gaps 4; 
Qy 2658 gaaaaagagcaagcaagagaaagagaagagcaagaagaaaaaaggaggtaaaacagaaca 2717 

I I iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 8 gcacgagagcaagcaagagaaagagaagagcaagaagaaaaaaggaggtaaaacagaaca 67 

Qy 2718 ggatggctatcagaaacccaccaacaaacacttcacgcagagtcccaa-gaagtcagtgg 2776 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii iiiiiiiiiii 

Db 68 ggatggctatcagaaacccaccaacaaacacttcacgcagagtcccaaggaagtcagtgg 127 
Qy 2777 ccgacctgctggggtccttt-gaaggcaaacgaa-gactccttctgatcactgctcccaa 2834 

Illlllilllllllllllll iiiiiiiiiiiii lllllllllllllllllllllllll 

Db 128 ccgacctgctggggtcctttggaaggcaaacgaaggactccttctgatcactgctcccaa 187 
Qy 2835 ggctgagaacaatatgtatgtgcaacaacgtgatgaatatctggaaagtttctgcaagat 2894 

Illllll I IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 

Db 188 ggctgag gaacaacaacgtgatgaatatctggaaagtttctgcaagat 235 

Qy 2895 ggctaccaggaaaatctctgtgatcaccatcttcggccctgtcaacaacagcaccatgaa 2954 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 236 ggctaccaggaaaatctctgtgatcaccatcttcggccctgtcaacaacagcaccatgaa 295 
Qy 2955 aatcgaccactttcagctagataatgagaagcccatgcgagtggtggatgatgaagactt 3014 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 296 aatcgaccactttcagctagataatgagaagcccatgcgagtggtggatgatgaagactt 355 
Qy 3015 ggtagaccagcgtctcatcagcgagctgaggaaagagtacggaatgacctacaatgactt 3074 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 356 ggtagaccagcgtctcatcagcgagctgaggaaagagtacggaatgacctacaatgactt 415 
Qy 3075 cttcatggtgctaacagatgtggatctgagagtcaagcaatactatgaggtaccaataac 3134 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 416 cttcatggtgctaacagatgtggatctgagagtcaagcaatactatgaggtaccaataac 4 75 
Qy 3135 aatgaagtctgtgtttgatctgatcgatactttccagtcccgaatcaaagatatggagaa 3194 

iiiiiiiiiiiiiii iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 476 aatgaagtctgtgttggatctgatcgatactttccagtcccgaatcaaagatatggagaa 535 
Qy 3195 gcagaagaaggagggcattgtttgcaaagaggacaaaaagcagtccctggagaacttcct 3254 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 536 gcagaagaaggagggcattgtttgcaaagaggacaaaaagcagtccctggagaacttcct 595 
Qy 3255 atccaggttccggtggaggaggaggttgctggtgatctctgctcctaacgatgaagactg 3314 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 596 atccaggttccggtggaggaggaggttgctggtgatctctgctcctaacgatgaagactg 655 

Qy 3315 ggcctattcacagcagctctctgccctcagtggtcaggcgtgcaattttggtctgcgcoa 3374 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 656 ggcctattcacagcagctctctgccctcagtggtcaggcgtgcaattttggtctgcgcca 715 

Qy 3375 cataaccattctgaagcttttaggcgttggagaggaagttgggggagtgttagaactgtt 3434 

IIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIII 

Db 716 cataaccattctgaagcttttaggcgttggagaggaagttgggggagtgttagaactgtt 775 
Qy 3435 cccaattaatgggagctctgttgttgagcgagaagacgtaccagcccatttggtgaaaga 34 94 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii II 

Db 776 cccaattaatgggagctctgttgttgagcgagaagacgtaccagcccatttgggtgaaag 835 
Qy 3495 cattcgtaactattttcaagtgagcccggagtacttctccatgcttctagtcggaaaaga 3554 



Db 


836 


Qy 


3555 


Db 


896 


Qy 


3615 


Db 


956 


Qy 


3675 


Db 


1016 


Qy 


3735 


Db 


1076 


Qy 


3795 


Db 


1136 


Qy 


3855 


Db 


1196 


Qy 


3915 


Db 


1256 



II I IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

acatccgtaactatttcaagtgagcccggagtacttctccatgcttctagtcggaaaaga 



895 



3614 



cggaaatgtcaaatcctggtatccttccccaatgtggtccatggtgattgtgtacgattt 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

cggaaatgtcaaatcctggtatccttccccaatgtggtccatggtgattgtgtacgattt 955 
aattgattcgatgcaacttcggagacaggaaatggcgattcagcagtcactggggatgcg 3674 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

aattgattcgatgcaacttcggagacaggaaatggcgattcagcagtcactggggatgcg 1015 
ctgcccagaagatgagtatgcaggctatggttaccatagttaccaccaaggataccagga 3734 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

ctgcccagaagatgagtatgcaggctatggttaccatagttaccaccaaggataccagga 1075 



tggttaccaggatgactaccgtcatcatgagagttatcaccatggatacccttactgagc 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
tggttaccaggatgactaccgtcatcatgagagttatcaccatggatacccttactgagc 



3794 



1135 



agaaatatgtaaccttagactcagccagtttcctctgcagctgctaaaactacatgtggc 3854 

III lllllllllllllllll Mill IIIIIIIIIIIIIIIIMIIIIIIIIIIIII III! 

agaaatatgtaaccttagactcagccagtttcctctgcagctgctaaaactacatgtggc 1195 
cagctccattcttccacactgcgtactacatttcctgcctttttctttcagtgtttttct 3914 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

cagctccattcttccacactgcgtactacatttcctgcctttttctttcagtgtttttct 1255 
aagactaaataaatagcaaactttcacct 3943 

lllllllllllllllll IIIIIIIIMI 

aagactaaataaatagccaactttcacct 1284 



AAT12174; 

08-AUG-1996 (first entry) 

Partial pJG4 -5-CDK-BP cDNA clone #127 encoding CDK4 binding protein. 

Cell cycle; regulation; Gl phase; proliferation; tumour igenesis; 
eye 1 in dependent kinase; differentiation; CDK4 inhibitor; agonist; 
antagonist; ss. 



Synthetic . 

Key 
CDS 



RESULT 13 
AAT12174 

ID AAT12174 standard; cDNA; 1285 BP. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 

PI 

XX 
DR 
DR 
XX 
PT 
PT 



Location/Qualifiers 
16. .894 
/*tag= a 

/note= "does not contain start codon" 



95WO-US07113. 
94US-0253155. 



W09533819-A2 

14-DEC-1995. 

02-JUN-1995; 

02-JUN-1994; 

{MITO-) MITOTIX INC. 

Draetta G, Gyuris J; 

WPI; 1996-040227/04. 
P-PSDB; AAR90545. 

Cyclin-dependent kinase-4 binding protein - used in the isolation of 
(ant) agonists of cell cycle regulation. 



XX 

PS Claim 30; Page 61-62; llSpp; English. 
XX 

CC AAT12162-T12184 are cDNA clones of the plasmid pJG4-5-CDK-BP, or 

CC fragments of the clones. The clones encode cyclin dependent kinase 4 

CC (CDK4) binding proteins {CDK4-BP) , which may be used in an assay for 

CC screening test compounds as inhibitors of CDK/CDK4-BP interaction. 

CC The complexes formed by CDK4 and D-type cyclins are strongly implicated 

CC in the control of the early Gl phase of the cell cycle and are strong 

CC candidates for controlling and/or preventing tumour igenesis and the 

CC onset of cancer. Nucleic acids encoding CDK4-BP or fragments of these 

CC may be used as probes/primers to diagnose the presence or absence of 

CC genetic lesions in a gene encoding 1 of the 24 CDK4-BP, and hence to 

CC diagnose the risk for a subject of developing a cell -proliferation 

CC associated disorder (e.g. cancer) . 

XX 

SQ Sequence 1285 BP; 376 A; 285 C; 319 G; 305 T; 0 Other; 



Query Match 28.8%; Score 1183.8; DB 17; Length 1285; 

Best Local Similarity 97.1%; Pred. No. 2.2e-288; 

Matches 1252; Conservative 0; Mismatches 22; Indels 15; Gaps 
Qy 2658 gaaaaagagcaagcaagagaaagagaagagcaagaagaaaaaaggaggtaaaacagaaca 2717 

I I IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIII 

Db 8 gcacgagagcaagcaagagaaagagaagagcaagaagaaaaaaggaggtaaaacagaaca 67 

Qy 2718 ggatggctatcagaaacccaccaacaaacacttcacgcagagtcccaa-gaagtcagtgg 2776 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIII lllllllllll 

Db 68 ggatggctatcagaaacccaccaacaaacacttcacgcagagtcccaaggaagtcagtgg 127 
Qy 2777 ccgacctgctggggtccttt-gaaggcaaacgaa-gactccttctgatcactgctcccaa 2834 

Illlllllllllllllllll lllllllllllll lllllllllllllllllllllllll 

Db 12 8 ccgacctgctggggtcctttggaaggcaaacgaaggactccttctgatcactgctcccaa 187 
Qy 2835 ggctgagaacaatatgtatgtgcaacaacgtgatgaatatctggaaagtttctgcaagat 2894 

Illllll I iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 188 ggctgag gaacaacaacgtgatgaatatctggaaagtttctgcaagat 235 

Qy 2895 ggctaccaggaaaatctctgtgatcaccatcttcggccctgtcaacaacagcaccatgaa 2 954 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 236 ggctaccaggaaaatctctgtgatcaccatcttcggccctgtcaacaacagcaccatgaa 295 
Qy 2955 aatcgaccactttcagctagataatgagaagcccatgcgagtggtggatgatgaagactt 3014 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 296 aatcgaccactttcagctagataatgagaagcccatgcgagtggtggatgatgaagactt 355 
Qy 3015 ggtagaccagcgtctcatcagcgagctgaggaaagagtacggaatgacctacaatgactt 3074 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

Db 356 ggtagaccagcgtctcatcagcgagctgaggaaagagtacggaatgacctacaatgactt 415 
Qy 3075 cttcatggtgctaacagatgtggatctgagagtcaagcaatactatgaggtaccaataac 3134 

IIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 416 cttcatggtgctaacagatgtggatctgagagtcaagcaatactatgaggtaccaataac 4 75 

Qy 3135 aatgaagtctgtgtttgatctgatcgatactttccagtcccgaatcaaagatatggagaa 3194 

lllllllllllllll llllllllllllllllllllllllllllllllllllllllllll 
Db 476 aatgaagtctgtgttggatctgatcgatactttccagtcccgaatcaaagatatggagaa 535 

Qy 3195 gcagaagaaggagggcattgtttgcaaagaggacaaaaagcagtccctggagaacttcct 32 54 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 536 gcagaagaaggagggcattgtttgcaaagaggacaaaaagcagtccctggagaacttcct 595 
Qy 3255 atccaggttccggtggaggaggaggttgctggtgatctctgctcctaacgatgaagactg 3314 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 596 atccaggttccggtggaggaggaggttgctggtgatctctgctcctaacgatgaagactg 655 



Qy 

Db 



3315 ggcctattcacagcagctctctgccctcagtggtcaggcgtgcaattttggtctgcgcca 3374 

IIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

656 ggcctattcacagcagctctctgccctcagtggtcaggcgtgcaattttggtctgcgcca 715 



Qy 


3375 


Db 


716 


Qy 


3435 


Db 


776 


Qy 


3495 


Db 


836 


Qy 


3555 


Db 


896 


Qy 


3615 


Db 


956 


Qy 


3675 


Db 


1016 


Qy 


3735 


Db 


1076 


Qy 


3795 


Db 


1136 


Qy 


3855 


Db 


1196 


Qy 


3915 


Db 


1256 



cataaccattctgaagcttttaggcgttggagaggaagttgggggagtgttagaactgtt 3434 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

cataaccattctgaagcttttaggcgttggagaggaagttgggggagtgttagaactgtt 775 
cccaattaatgggagctctgttgttgagcgagaagacgtaccagcccatttggtgaaaga 34 94 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii II 

cccaattaatgggagctctgttgttgagcgagaagacgtaccagcccatttgggtgaaag 835 
cattcgtaactattttcaagtgagcccggagtacttctccatgcttctagtcggaaaaga 3554 

II I iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

acatccgtaactatttcaagtgagcccggagtacttctccatgcttctagtcggaaaaga 895 
cggaaatgtcaaatcctggtatccttccccaatgtggtccatggtgattgtgtacgattt 3614 

IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIIIMII 

cggaaatgtcaaatcctggtatccttccccaatgtggtccatggtgattgtgtacgattt 955 
aattgattcgatgcaacttcggagacaggaaatggcgattcagcagtcactggggatgcg 3674 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

aattgattcgatgcaacttcggagacaggaaatggcgattcagcagtcactggggatgcg 1015 
ctgcccagaagatgagtatgcaggctatggttaccatagttaccaccaaggataccagga 3734 

IIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

ctgcccagaagatgagtatgcaggctatggttaccatagttaccaccaaggataccagga 1075 
tggttaccaggatgactaccgtcatcatgagagttatcaccatggatacccttactgagc 3794 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 

tggttaccaggatgactaccgtcatcatgagagttatcaccatggatacccttactgagc 1135 
agaaatatgtaaccttagactcagccagtttcctctgcagctgctaaaactacatgtggc 3854 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMMIIIIIIM 

agaaatatgtaaccttagactcagccagtttcctctgcagctgctaaaactacatgtggc 1195 
cagctccattcttccacactgcgtactacatttcctgcctttttctttcagtgtttttct 3914 

IIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

cagctccattcttccacactgcgtactacatttcctgcctttttctttcagtgtttttct 1255 
aagactaaataaatagcaaactttcacct 3943 

IMIIIIIIIIIIIIII IIIMIIIIII 

aagactaaataaatagccaactttcacct 1284 



Issued: 

SUMMARIES 

% 

Result Query- 
No. Score Match Length DB ID Description 



1 


890 


21 


.6 


1075 


1 


US- 


■08- 


253 


-155A-12 


Sequence 


12, Appl 


2 


863 


21 


.0 


1088 


1 


US- 


■08- 


•726 


-525-3 


Sequence 


3, Appli 


3 


863 


21 


.0 


1088 


1 


US- 


08- 


•487 


-942-3 


Sequence 


3 , Appli 


4 


863 


21 


.0 


1088 


2 


US- 


■08- 


726 


-036A-3 


Sequence 


3 , Appli 


5 


863 


21 


.0 


1088 


4 


US- 


09- 


■083 


-516-3 


Sequence 


3, Appli 


6 


94.6 


2 


.3 


7218 


1 


us- 


08- 


232 


-463-14 


Sequence 


14, Appl 


7 


89.4 


2 


.2 


306 


1 


US- 


08- 


253 


-155A-57 


Sequence 


57, Appl 


8 


69.4 


1, 


. 7 


16442 


3 


US- 


08- 


781 


-891-208 


Sequence 


208, App 


9 


68.2 


1 , 


.7 


2223 


1 


US- 


08- 


257 


-073-4 


Sequence 


4, Appli 


10 


61 


1, 


.5 


43795 


3 


us- 


08- 


742 


-185-101 


Sequence 


101, App 


11 


57 .8 


1 , 


.4 


240 


1 


us- 


08- 


628 


-417-6 


Sequence 


6, Appli 


12 


56.6 


1, 


.4 


2447 


2 


us- 


09- 


014 


-969-14 


Sequence 


14, Appl 


13 


55.6 


1, 


.4 


376 


2 


us- 


08- 


623 


-906A-18 


Sequence 


18, Appl 


14 


55.4 


1, 


.3 


696 


4 


us- 


09- 


461 


-697-193 


Sequence 


193, App 


15 


55.4 


1, 


.3 


699 


4 


us- 


09- 


461 


-697-191 


Sequence 


191, App 


16 


55.4 


1, 


.3 


717 


4 


us- 


09- 


461 


-697-189 


Sequence 


189, App 


17 


55 .4 


1, 


.3 


774 


4 


us- 


09- 


461 


-697-187 


Sequence 


187, App 


18 


55.4 


1, 


.3 


819 


4 


us- 


09- 


461 


-697-185 


Sequence 


185, App 


19 


55.4 


1, 


.3 


1669 


4 


us- 


09- 


461 


-697-184 


Sequence 


184, App 


20 


55.2 


1, 


.3 


72604 


4 


us- 


09- 


268 


-992-7 


Sequence 


7, Appli 



RESULT 1 
US-08-253-155A-12 

; .Sequence 12, Application US/08253155A 
; Patent No. 5691147 
; GENERAL INFORMATION: 

APPLICANT: Gyuris, Jeno 

APPLICANT: Draetta, Giulio 

TITLE OF INVENTION: CDK4 Binding Proteins 

NUMBER OF SEQUENCES: 95 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD 

STREET: 60 State Street 

CITY: Boston 

STATE : MA 

COUNTRY : USA 

ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: ASCII (text) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/253 , 155A 

FILING DATE: 02-JUN-1994 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: Vincent, Matthew P. 

REGISTRATION NUMBER: 36,709 

REFERENCE/DOCKET NUMBER: MI I -02 8 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (617) 227-7400 

TELEFAX: (617) 227-5941 
; INFORMATION FOR SEQ ID NO: 12: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1075 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
US-08-253-155A-12 



Query Match 21.6%; 
Best Local Similarity 96.2%; 
Matches 975 ; Conservative 



Score 890; DB 1; Length 1075; 
Pred. No. 2.1e-237; 
1; Mismatches 31; Indels 



Gaps 



Qy 2918 tcaccatcttcggccctgtcaacaacagcaccatgaaaatcgaccactttcagctagata 2977 

II II IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIII 

Db 44 TCTGCAAGTTCGGCCCTGTCAACAACAGCACCATGAAAATCGACCACTTTCAGCTAGATA 103 

Qy 2978 atgagaagcccatgcgagtggtggatgatgaagacttggtagaccagcgtctcatcagcg 3037 

IIIIMIIMIIIIIIIIMIIIIIIIIIIIIIIII lllllllllllllllllllllll 

Db 104 ATGAGAAGCCCATGCGAGTGGTGGATGATGAAGACTGGGTAGACCAGCGTCTCATCAGCG 163 
Qy 3038 agctgaggaaagagtacggaatgacctacaatgacttcttcatggtgctaacagatgtgg 3097 

Illlllllll Mlllllll lllllllll III lllllllllllllllllllllllll 

Db 164 AGCTGAGGAAGGAGTACGGATTGACCTACACTGA - TTCTTCATGGTGCTAACAGATGTGG 222 



Qy 



Db 



3098 



223 



atctgagagtcaagcaatactatgaggtaccaataacaatgaagtctgtgtttg-atctg 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIIIII Mill 

ATCTGAGAGTCAAGCAATACTATGAGGTACCAATAACAATGAAGTCTGTGTTTGCATCTG 



3156 



282 



Qy 


3157 


atcgatactttccagtcccgaatcaaagatatggagaagcagaagaaggagggcattgtt 


3216 






IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


283 


ATCGATACTTTCCAGTCCCGAATCAAAGATATGGAGAAGCAGAAGAAGGAGGGCATTGTT 


342 


Qy 


3217 


tgcaaagaggacaaaaagcagtccctggagaacttcctatccaggttccggtggaggagg 


3276 






IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIII 




Db 


343 


TGCT^GAGGACAAAAAGCAGTCCCTGGAGAACTTCCTATCCAGGTTCCGGTGGAGGAGG 


402 


Qy 


3277 


aggttgctggtgatctctgctcctaacgatgaagactgggcctattcacagcagctctct 


3336 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIM 

AGGTTGCTGGTGATCTCTGCTCCTAACGATGAAGACTGGGCCTATTCACAGCAGCTCTCT 462 
gccctcagtggtcaggcgtgcaattttggtctgcgccacataaccattctgaagctttta 3396 

llllllllllllllllllllllllll llllllllllllllllllllllllllll MM 

GCCCTCAGTGGTCAGGCGTGCAATTTGGGTCTGCGCCACATAACCATTCTGAAGC - TTTA 521 

ggcgttggagaggaagttgggggagtgttagaactgttcccaattaatgggagctctgtt 34 56 

llllllllllllllllllllllllll lllllllllllllllllllllllllllllllll 
GGCGTTGGAGAGGAAGTTGGGGGAGTCTTAGAACTGTTCCCAATTAATGGGAGCTCTGTT 581 

gttgagcgagaagacgtaccagcccatttggtgaaagacattcgtaactattttcaagtg 3516 

llllllllllllllllhlllllllllll IIMIIIIIIIIIIIIIIIIIIIIIIIIII 

GTTGAGCGAGAAGACGTWCCAGCCCATTT-GTGAAAGACATTCGTAACTATTTTCAAGTG 640 
agcccggagtacttctccatgcttctagtcggaaaagacggaaatgtcaaatcctggtat 3576 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 

AGCCCGGAGTACTTCTCCATGCTTCTAGTCGGAAAAGACGGAAATGTCAAATCCTGGTAT 700 

ccttccccaatgtggtccatggtgattgtgtacgatttaattgattcgatgcaacttcgg 3636 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CCTTCCCCAATGTGGTCCATGGTGATTGTGTACGATTTAATTGATTCGATGCAACTTCGG 760 

agacaggaaatggcgattcagcagtcactggggatgcgctgcccagaagatgagtatgca 3696 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AGACAGGAAATGGCGATTCAGCAGTCACTGGGGATGCGCTGCCCAGAAGATGAGTATGCA 820 
ggctatggttaccatagttaccaccaaggataccaggatggttaccaggatgactaccgt 3756 

IIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIIMIIIIIIIIIIIIIII 

GGCTATGGTTACCATAGTTACCACCAAGGATACCAGGATGGTTACCAGGATGACTACCGT 880 

catcatgagagttatcaccatggatacccttactgagcagaaatatgtaaccttagactc 3816 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
CATCATGAGAGTTATCACCATGGATACCCTTACTGAGCAGAAATATGTAACCTTAGACTC 94 0 

agccagtttcctctgcagctgctaaaactacatgtggccagctccattcttccacact-g 3875 

I I iiiiiiiiiiiiiiiii mill iiiiiiiiiiiiiiiiiiiiiiii I 

A - - CCATTTCCTCTGCAGCTGCTGGAACTACGATTGGCCAGCTCCATTCTTCCACACTGG 998 
cgtactacatttcctgcctttttctttcagtgtttttctaagactaaataaata 3929 

lllllllllllllll llllllllllll I Mil I I IIIMI 

GGTACTACATTTCCTGGCTTTTTCTTTCAAGGGTTTTTCTTTAGGACTTAAATA 1052 

RESULT 2 
US-08-726-525-3 

Sequence 3, Application US/08726525 
Patent No. 5789181 
GENERAL INFORMATION: 

APPLICANT: Lin, Lih-Ling 
APPLICT^T: Graham, James 

TITLE OF INVENTION: NOVEL INTERLEUKIN-1 RECEPTOR 

TITLE OF INVENTION: INTRACELLULAR LIGAND PROTEINS AND INHIBITORS OF LIGAND 
TITLE OF INVENTION: BINDING 
NUMBER OF SEQUENCES: 7 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: LEGAL AFFAIRS, GENETICS INSTITUTE, INC. 
STREET: 87 CambridgePark Drive 
CITY: Cambridge 
STATE : MA 
COUNTRY: USA 
ZIP: 02140 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC -DOS /MS -DOS 
SOFTWARE: Patentin Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/726,525 
FILING DATE: 07-OCT-1996 
CLASSIFICATION: 435 



Db 


403 


Qy 


3337 


Db 


463 


Qy 


3397 


Db 


522 


Qy 


3457 


Db 


582 


Qy 


3517 


Db 


641 


Qy 


3577 


Db 


701 


Qy 


3637 


Db 


761 


Qy 


3697 


Db 


821 


Qy 


3757 


Db 


881 


Qy 


3817 


Db 


941 


Qy 


3876 


Db 


999 



PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/487,942 

FILING DATE: 07-JUN-1995 
ATTORNEY/ AGENT INFORMATION: 

NAME: Brown, Scott A. 

REGISTRATION NUMBER: 32,724 

REFERENCE/DOCKET NUMBER: GI5258 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (617) 498-8224 

TELEFAX: (617) 876-5851 
; INFORMATION FOR SEQ ID NO : 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1088 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : dOXlble 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
HYPOTHETICAL: NO 
FEATURE : 

NAME/KEY: CDS 

LOCATION: 2 , . 961 
US-08-726-525-3 



Query Match 21.0%; Score 863; DB 1; Length 1088; 

Best Local Similarity 85.3%; Pred. No. 6.9e-230; 

Matches 1067; Conservative 0; Mismatches 0; Indels 184; Gaps 

Qy 2694 gaaaaaaggaggtaaaacagaacaggatggctatcagaaacccaccaacaaacacttcac 2753 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 GAAAAAAGGAGGTAAAACAGAACAGGATGGCTATCAGAAACCCACCAACAAACACTTCAC 60 

Qy 2754 gcagagtcccaagaagtcagtggccgacctgctggggtcctttgaaggcaaacgaagact 2813 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMI 

Db 61 GCAGAGTCCCAAGAAGTCAGTGGCCGACCTGCTGGGGTCCTTTGAAGGCAAACGAAGACT 120 



Qy 2 814 ccttctgatcactgctcccaaggctgagaacaatatgtatgtgcaacaacgtgatgaata 2873 

IIIIIIIMIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIII 

Db 121 CCTTCTGATCACTGCTCCCAAGGCTGAGAACAATATGTATGTGCAACAACGTGATGAATA 180 
Qy 2874 tctggaaagtttctgcaagatggctaccaggaaaatctctgtgatcaccatcttcggccc 2933 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 

Db 181 TCTGGAAAGTTTCTGCAAGATGGCTACCAGG7VAAATCTCTGTGATCACCATCTTCGGCCC 24 0 
Qy 2934 tgtcaacaacagcaccatgaaaatcgaccactttcagctagataatgagaagcccatgcg 2 993 

IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 241 TGTCAACAACAGCACCATGAAAATCGACCACTTTCAGCTAGATAATGAGAAGCCCATGCG 300 

Qy 2 994 agtggtggatgatgaagacttggtagaccagcgtctcatcagcgagctgaggaaagagta 3053 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 301 AGTGGTGGATGATGAAGACTTGGTAGACCAGCGTCTCATCAGCGAGCTGAGGAAAGAGTA 360 

Qy 3054 cggaatgacctacaatgacttcttcatggtgctaacagatgtggatctgagagtcaagca 3113 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 361 CGGAATGACCTACAATGACTTCTTCATGGTGCTAACAGATGTGGATCTGAGAGTCAAGCA 420 

Qy 3114 atactatgaggtaccaataacaatgaagtctgtgtttgatctgatcgatactttccagtc 3173 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 421 ATACTATGAGGTACCAATAACAATGAAGTCTGTGTTTGATCTGATCGATACTTTCCAGTC 4 80 

Qy 3174 ccgaatcaaagatatggagaagcagaagaaggagggcattgtttgcaaagaggacaaaaa 3233 

llllllllllllllllllllllllllllllllllllllllllllllll 
Db 481 CCGAATCAAAGATATGGAGAAGCAGAAGAAGGAGGGCATTGTTTGCAA 528 

Qy 3234 gcagtccctggagaacttcctatccaggttccggtggaggaggaggttgctggtgatctc 3293 

Db 529 528 



Qy 

Db 



3294 tgctcctaacgatgaagactgggcctattcacagcagctctctgccctcagtggtcaggc 3353 
529 528 



Qy 


3354 


Db 


529 


Qy 


3414 


Db 


538 


Qy 


3474 


Db 


598 


Qy 


3534 


Db 


658 


Qy 


3594 


Db 


718 


Qy 


3654 


Db 


778 


Qy 


3714 


Db 


837 


Qy 


3774 


Db 


897 


Qy 


3834 


Db 


957 


Qy 


3894 


Db 


1017 



gtgcaattttggtctgcgccacataaccattctgaagcttttaggcgttggagaggaagt 3413 

lllllllll 

AGAGGAAGT 537 

tgggggagtgttagaactgttcccaattaatgggagctctgttgttgagcgagaagacgt 3473 

IIIIIIIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

TGGGGGAGTGTTAGAACTGTTCCCAATTAATGGGAGCTCTGTTGTTGAGCGAGAAGACGT 597 
accagcccatttggtgaaagacattcgtaactattttcaagtgagcccggagtacttctc 3533 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 

ACCAGCCCATTTGGTGAAAGACATTCGTAACTATTTTCAAGTGAGCCCGGAGTACTTCTC 657 
catgcttctagtcggaaaagacggaaatgtcaaatcctggtatccttccccaatgtggtc 3593 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIII 

CATGCTTCTAGTCGGAAAAGACGGAAATGTCAAATCCTGGTATCCTTCCCCAATGTGGTC 717 
catggtgattgtgtacgatttaattgattcgatgcaacttcggagacaggaaatggcgat 3653 

IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

CATGGTGATTGTGTACGATTTAATTGATTCGATGCAACTTCGGAGACAGGAAATGGCGAT 777 
tcagcagtcactggggatgcgctgcccagaagatgagtatgcaggctatggttaccatag 3713 

llllllllllllllllllllllll IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII 

TCAGCAGTCACTGGGGATGCGCTG - CCAGAAGATGAGTATGCAGGCTATGGTTACCATAG 836 

ttaccaccaaggataccaggatggttaccaggatgactaccgtcatcatgagagttatca 3773 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TTACCACCAAGGATACCAGGATGGTTACCAGGATGACTACCGTCATCATGAGAGTTATCA 896 

ccatggatacccttactgagcagaaatatgtaaccttagactcagccagtttcctctgca 3833 

IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIIIIIIIIII 

CCATGGATACCCTTACTGAGCAGAAATATGTAACCTTAGACTCAGCCAGTTTCCTCTGCA 956 
gctgctaaaactacatgtggccagctccattcttccacactgcgtactacatttcctgcc 3893 

IMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIMIIIIIIMIIIIIII 

GCTGCTAAAACTACATGTGGCCAGCTCCATTCTTCCACACTGCGTACTACATTTCCTGCC 1016 

tttttctttcagtgtttttctaagactaaataaatagcaaactttcaccta 3944 

lllllllllllllllllllllllllllllllllllllllllllllllllll 
TTTTTCTTTCAGTGTTTTTCTAAGACTAAATAAATAGCAAACTTTCACCTA 1067 



SUMMARIES 

% 

Result Query- 
No. Score Match Length DB ID * Description 



1 


1273 


.2 


30 


.9 


1679 


11 


AK011256 


AK011256 


Mus muscu 


2 


1022 


.4 


24 


.8 


1550 


11 


BC022704 


BC022704 


Mus muscu 


3 


717 


.8 


17 


.4 


746 


10 


BE889110 


BE889110 


601513557 


4 


713 


17, 


.3 


886 


10 


BG035195 


BG035195 


602324702 


5 


709 


.8 


17, 


.2 


902 


11 


AK009795 


AK009795 


Mus muscu 


6 


704 


.8 


17, 


.1 


857 


10 


BI763566 


BI763566 


603050110 


7 


702, 


.4 


17, 


.1 


738 


10 


BM051031 


BM051031 


603633915 


8 


679. 


.2 


16, 


.5 


818 


10 


BG120262 


BG120262 


602353649 


9 


678, 


.8 


16, 


.5 


1199 


10 


BG253437 


BG253437 


602363350 


10 


671, 


.8 


16, 


.3 


1057 


10 


BF338364 


BF338364 


602034007 


11 


670, 


.2 


16, 


.3 


881 


10 


BI830199 


BI830199 


603072871 


12 


668, 


.8 


16, 


.2 


723 


10 


BG122685 


BG122685 


602351812 


13 


663 


16, 


.1 


832 


10 


BI109173 


BI109173 


602897030 


14 


660, 


.2 


16, 


.0 


806 


10 


BI109220 


BI109220 


602897087 


15 


654, 


.2 


15. 


.9 


659 


9 


AW303375 


AW303375 xvl8cll.x 


16 


651, 


.8 


15. 


.8 


749 


10 


BF037455 


BF037455 


601461038 


17 


651. 


.8 


15. 


.8 


1074 


11 


AK020169 


AK020169 


Mus muscu 



